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Abstract 


Digital tiansmission stoiage and repioduction of audio signals as effectiveb as possible 
needs idaptation to the perceptual characteiistics of the human auditory svstfin Tins is 
best accomplished in the fiequency domain A Tiansfoim Coder based on the Dis(itt( 
Fouiiei Timsfoim is implemented Peiceptual paiameteis lelevant to loy bil latt (oding 
are computed for a segment of sampled audio signal 
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Chapter 1 


Introduction 

1 1 Need Of Low Bit Rate Coders 

Digital sound piocessmg and storage adopted m audio aie providing excellent sound ipiil 
itj Hoveici converting an audio signal to a 16 bit digital format with appiopiiatf irdnn 
danf> foi ciior correction and with a minimum sampling rate requires extremch e\1( nek d 
bandvi idtli for signal transmission and storage the latter coupled with huge m iss stoi ig( 
necessities The large bandwidth results m pioblems for ladio tiansmission m paiticulu 
so tlicio IS considerable interest m avoiding anj redundancy in the signal other thin foi 
error coiiectron purposes To achieie sound transmission or reproduction that is nol onh 
\ei\ good but also efficient all equipment has to be adapted to the chaiacteiisLu s of tin 
final rccenei in this case the human auditory system Anj part of the transmitted sign d 
that is not recognized by the auditory system shows bad matching to the reeeireis and 
pio\ides necessary ledundancy Therefore digital audio coelers are designed to lediue flu 
ledundanci and ii relevancy in a signal for the purpose of bit rate reduction Th it is digit d 
waveform codeis[4] seek to be in a region where digitized signals are peiceptually ule\ int 
and statiscally nonredundant 

Low bit rate coders if available with the requisite performance could be us(cl in i 
number of places such as for remote broadcast lines studio links satellite transmission 
of high quality audio and the like For instance a channel with 15 KHz bandwidth Ih it 
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IS ppueptualh tiansparent would be \en useful to broadcast media who could use it to 
pio\ icle a lemote link liee of the noise and fiequenc'v response pioblems noirnxlh assoc latcd 
with Icised and/oi equalized lines 

Focus on widei bandwidth signals of 15 IvH/ oi 20 KHz has centcied pnin uih on l)it 
idte 1 eduction techniques The perceptual Tiansfoim Codei [3] desciibed in this thesis uses 
a laige encoding delay and has considerably gicater complexiH but piovides trausp iient 
coding of 1 wide class of 15 KHz bandwidth audio signals at 4 bits pei sample 

1 2 Overview Of The Thesis 

111 this thesis we ha\e attempted to gam an undeistandmg of Perceptual Tiansfoim Coding 
concept by implementing a DFT based Tiansform Coder and computing some of the 
rc lc\ mt peiceptual parameters of a segment of sampled audio data The coding algorithm 
IS based on Johnston s papei[3] 

111 the next chaptei we leview the ISO/MPEG Audio standard and chaplei 3 i(\iews 
about the basic psychoacoustic piopeity exploited m peiceptual audio coding Chaptei 4 
describes the implementation of DFT based Tiansform Coder Chapter 5 discusses diiof 
tions in Digital Audio Broadcastmg(DAB) 


2 


Chapter 2 


ISO/MPEG Audio Standard 

2 1 MPEG Audio Compression Standard 

Digit il coded audio lequiiiiig low bit lates has led the ISO/IEC(Iiiteinational Oigaiiuatioii 
foi Staiidaidization/Intei national Electiotechnical Commission) standardization IjodA to 
estiblish ]\IPEG(Moving Pictuies Expcits Group) standards for audio and Mcleo (odiiif, 
MPEG A-iidio standard specifies the coded lepicsentation of high ciuality audio foi stoi if,e 
media and the method foi decodmg[8] The compression techniques use psycho u oiistic 
models for predicting the human auditory response to the noise that is mtiodnced In the 
coding scheme Using these models the characteristics of the compression schciiK ( an be 
changed djnamicall} m order to minimize the audibility of these noise imp urnients 

The MPEG Audio standard is intended for application to digital storage medi > sin h as 
Compact Disc(CD) Digital Audio Tape(D\T) and magnetic hard disc The stoi age me dia 
maa either be connected directly to the decoder or a la other means such as commiimc itioii 
lines MPEG Audio standard is intended for sampling rates of 32 KHz 44 1 KH/ iiid 48 
KHz at bit rates in a range of 32 192 Kbits per sec per mono and 64 384 Kbits pe i see 
per stereo luclio channel 

Depending on the application three different layers of the coding system with me le asing 
coder complexity and performance can be used 

Layer I This layer contains the basic mapping of the digital audio input into 32 sub 
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bands fixed segmentation to foimat the data into blocks a ps}choacousti( mod( 1 to 
detciminc the adaptne bit allocation and quantization using block comp indin^ md 
foimattmg This layci is most appiopriate foi consumei applications such is digital 
home lecording on tapes that is for those applications for which vei\ low d ita iat( s 
ai( not manclatoiv 

Layer II This layer intioduces fuithci compression with le&pect to La-vei I b^ icdund nice 
and iiiGlG\ance lemoval on scale factors and uses nioie precise quantization This 
la'\ci has niimeioiis applications m both consumer and piofcssional luclio such is 
Digital Audio Bioadcasting(DA.B) recording telecommunication and inultimc di i 

Layer III This la’vei introduces increased ficciuencv resolution brsed on a h\biid filtti 
bail! It adds nonumfoim quantization adaptne segmentation and eiitiop-^ coding 
of the quantized values for better coding efhciency This layer is most appiopiiatc 
foi telecommunication applications m particular with nairowband ISDA and foi 
professional audio having high weights on very low bit lates 

2 2 Basic Structure Of ISO/MPEG/ 

Audio Encoder 

The bisic structure of a perceptual audio coding siheme[6] is shown in Fig 2 1 
Fig 2 1(a) illustrates the basic structure of ISO/MPEC/ ALudio Encoder 
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Figuie 2 1 (a) Audio Encoder (b) Audio Decoder 


It consists of 

1 A. time fiequency mapping (filtei bank) is used to decompose the input signal into 
subsainpled spectial components Depending on the filtei bank used these au called 
subband \alues oi frequenc} lines 

2 The output of this filtei bank oi the output of a paiallel transfoim is use el to 
calculate an estimate of the actual(t]me dependent) noise masking threshold using 
lules known fiom ps^v choacoustics 

3 The subband samples oi frequency lines are quantized and coded with the run of 
keeping the noise which is intioduced by quantizing below the masking thieshold 

4 A flame packing is used to assemble the bit stream which typically consists of the 
qu rntized and coded mapped samples and some side information such as bit allot i 
tion information 
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This thesis focuses on high frequency resolution which leads to onlj limited tune lesolution 
Such S'! stems aie usualh called Perceptual Tiansform Coders 


2 3 Basic Structure Of ISO /MPEG/ 

Audio Decoder 

Fig 2 1(b) illustrates the basic structure of ISO/MPEG/ Audio Decoder Bit stieam data 
lb fed into the decoder The bit stream unpacking and decoding block does eiioi detection 
if eiioi ch( ek is applied in the encoder The bit stieam aie unpacked to lecoiei the \ uioiis 
pieces of mfoi matron The reconstruction block leconstiucts the quantized veision of the 
set of rnipped samples The fiequencv time mapping transforms these mappe d simples 
bick into uniform PCM 


Chapter 3 


Psychoacoustics And Perceptual 
Coding 

3 1 Auditory Masking 

Most of the time oui woild presents us with a multitude of sounds simultanoush It is 
often difficult to heai one sound when a much louclei sound is present This pioccss sf (ins 
mtutnc but on the psycho ic oustic and cognitnc levels it becomes \ei'v compl(\ Tlu 
teim foi this process is masking Masking is defined as the amount bj which tlu thnshold 
of ludibility foi one sound is raised bv the presence of another (masking) sound [10] Tin 
basic idea in the psychoacoustic approach is to determine which pait of the noise is irnsl c d 
by the sigiirl and which pait is audible 

Masling expeiiments using steady state signals have led to the conclusion tint oui 
luditoiv system perfoims a spectral analysis which can be modeled by a eontiuuous hltei 
bulk of bandpass filters having a bandwidth of 100 Hz for frequencies beloi\ 500 H/ iiiel 
a bandwidth of approximately one sixth of the center frequency[7] above 500 Hz Tli it is 
OUI auditoi\ S3^stem analyses in parts that correspond to critical bands A. list of eiiticil 
band edge fieciuencies is given m Table 3 1 
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Band No 

Hz 

Lower Edge 

Hz 

Center 

H 

Upper Edge 

Hz 

1 

0 

50 

100 

2 

100 

150 

200 

3 

200 

2b0 

300 

4 

300 

350 

400 

5 

400 

450 

510 

6 

510 

570 

630 

7 

630 

700 

770 

8 

770 

810 

920 

9 

920 

1000 

1080 

10 

1080 

1170 

1270 

11 

1270 

1370 

1480 

12 

1480 

1600 

1720 

13 

1720 

1850 

2000 

14 

2000 

2150 

2320 

15 

2320 

2500 

2700 

16 

2700 

2900 

3150 

17 

3150 

3400 

3700 

18 

3700 

4000 

4400 

19 

4400 

4800 

5300 

20 

5300 

5800 

6400 

21 

6400 

7000 

7700 

22 

7700 

8500 

9500 

23 

9500 

10500 

12000 

24 

12000 

13500 

15500 


Table 3 1 A List Of Critical Band Center And Edge Frequencies 
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Fiom the Table 3 1 it is obvious that one critical band is added to the next so that 
the iippei limit of the lover critical band coiiesponds to the lower limit of the next higlni 
ciiticdl band thus producing the scale of the cntiral band late The critical bind i itc 
scale mentioned[l][5] follows a lineai scale up to about oOO Hz and then a logdiitlimic 
frequency scale above 500 Hz 

Fig 3 1(a) shows the uncoiled innei cai including the basilar membrane It iiidu itcs 
that the ciitical band late scale is directly i elated to the place along the basilai rnc nibianc 
when all the scnsoiy cells aic located in a veiy c quidistant configuiation Thus the ( iitn al 
band lale scale is closeH lelatcd to oui physiology too Fig 3 1(b) and Fig 3 1(( ) shows 
the dependence of ciitical band late on fiequenc> as illustiated The ciitical band concept 
has led to the definition of the hark scale as the psychoacoustic equivalent of the hcciucnci 
scale 


3 2 Psychoacoustic Models 

The MPEG audio standard is able to maintain CD audio quality reproduction up to a 
compression ratio of 5 to 1 oi better This bit rate coiresponds to encoding each audio 
sample with an ateiage of 3 to 4 bits pei sample The MPEG coding scheme acliit ics this 
coinpiession by placing the quantization noise in those regions of the fiequenca spec ti urn 
wdieu the human ear is least sensitive 

The ps} choacoustic model is used to calculate the perceptual thieshold foi each lime 
segment of the source material This threshold which estimates the maximum noise that 
can be in audibly inserted into the signal is used in the coder to maximize the amount 
of perceptually acceptable noise that can be inseited into a given signal segment A.s 
the amount of the quantization noise is diiectlv related to the number of bits used b\ 
the quantizer the bit allocation algorithm assigns the available bits in a maiinei which 
minimizes the audible distortion 
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Chapter 4 


Implementation Of Perceptual 
Transform Coder 

4 1 Why Transform Coder‘s 

The Frequency Domain’ approach which exploits linear dependencies between inpul sain 
pies foi cfhcient digitization is c riled Transform Coding In this thesis Transfoi ni C oding 
IS us( d to ichieve the desired bit rate reduction for it is moie robust than DPC M nitli re 
gaid to input statistics and channel eriors The piice paid for these advantages is intieased 
encoding delay which is proportional to tiansfoim length N and an encoding i oinpliMh 
in th( case with adaptive bit assignment 

Fig 4 1(a) IS a block diagram of transform coding It is i vaveform digitizing pioctduie 
where i block of N input samples x[n] is lineaily transformed into a set of N tnnsfoim 
coefhcients 6>[n] The cofhcnnts are then quantized foi transmission and i leconstiuc 
tioii of \[n] is obtained at the receiver using an inverse tiansform operation on qumtized 
coefficients 

Fig 4 1(b) shows how the above coding scheme is implemented The input blocl is 
high resolution PCM samples The output of the coder is the combination of the outputs 
of ’IN PCM codeis that convey quantized uncorrelated coefficient information hpicallv 
with 1 much lover total bit rate than what is present at the coder input 
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TRANSMITFER CH \NNCL RECEIVER 



Figuic 4 1 (a)Transform Coder (b)Implementation of Encoder 

The efficienc} of a transfoim coding system y ill depend on the type of lineai ti uisfoiin 
and the nituie of bit allocation foi quantizing transfoim coefficients Most pi irtic d s\s 
terns will be based on Disciete Fouiiei Transfoim(DFT) approach 
DFT IS defined by 


N-l 


0[}] = ^ A =0 1 2, iV-1 

n=0 

%[n\ = l/A^ E' n = 0 1 2 - 1 


(11) 

( 12 ) 


DFT lepiesentations ha\e tyo important adyantages 

• DFT makes of sine cosine basis space Hence it pioyides a natural fiarneyoik foi 
optimwations of transfoim coder design with inputs where the perceptual effects of 
signal distortion are best understood in the frequency domain 
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• DFT IS advantageous with le&pect to computation also A. diiect evaluation of an \ 
point DFT lequiiGS about complex multiph add opeiations but if N is i pimf i of 
2 the Fast Fouiiei Tiansfoim(FFT) method needs in the ordei of iYlogMipti itioiis 
With N=i024 the FFT is about one hundred times fastei than a diiect coinpul itioii 
111 the case of leal valued inputs fuithei reductions are possible b\ exploiting the 
piopeities of the DFT 

• Th( FFT algoiithm can also be used to eialuate the imeise opeiation 

4 2 Implementation Of Transform Coder 

A blocl diagiam of the Transform Codei as implemented using Matlab package on Pen 
tmrn PC (cs 133 MHz, is shown m Fig 4 2 The source material is 16 bit PCM inonophomc 
audio signil of 15 KHz bandvidth sampled at 32 KHz The window tint is used in 
the overlap sectrons is the sqauie root of a Hanning window of length equal to twice flu 
oveilap that is 256 data points since the windowing process for overlap add and luihsis 
pi o\ ides for an oveilap add of Kth This overlap add window is shown in Fig 4 3 for I line 
successive frames wheie each frame contains 2048 data points By allowing tins cncilip 
between (he successiie Fames reduction of end effect noise is achieved without signih- 
canth lowreimg the number of bits available for encoding each frame With the ovc il ip of 
Kth the numbei of new data points m each fiame is 1920 

The TFT is implemented diiectly on the windowed data The 20 18 real input d ita iic 
coiiveited to a spectrum of 1024 complex points counting the dc and Nyquist fiequcnc^ 
as one complex point The spectral peaks of each 8 complex points are cilculitcd Sinci 
the particular transform is a real complex tiansform of length 2048 there are 1024 unique 
complex pans and 128 spectral peak values These spectral peak lalues aie quintized 
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TO CHANNEL 


Figuie 4 2 Block Diagram Of Transform Coder 
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Sample n mber 

Figure 4 3 Overlap add window 

using 8 bits wlieie each step size in the quantizer represents ^ = 0 664dB The icasoii 
for choosing the numbei 170 is as follows 

The size of the quantizei is dcteimined by the possible dj namic range of the 2018 iioint 
spectium given a 16 bit PCM input souice In a tiansfoim the total specti il chn uuic 
range can vaiy fiom that of a unit impulse here set to quantizer level 1 to tin spectial 
eiieigy ol i full scale siiiewave roughly 167 dB higher hence the total range of 170 clB 
Then the numbei of bits wailable foi coding the samples aie calculated 
It leciuiies the calculation of the following 

• The total numbei of bits available m a block of data 

• The numbei of side mfoimation bits required The side mfoimation is used m both 
the tiinsmittei and leceivei to calculate bit allocation 
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Since { ich block contains 1920 new data samples of 8 bits per sample the total niiinbf i 
of bits u nlable comes out to be 15360 
The side iiifoimation consists of three parts 


• A. 16 bit PCM encoding of the dc term from the FFT 


• The thieshold is encoded m 8 bit PCM where each step size is 0 664 dB 


• The 128 spectral peaks are encoded m 8 bit PCM using step size of 0 664 dB 

By t living into consideration the above details the number of bits required toi sich iiiloi 
mation is calculated to be 1040 This number is then subtrreted from the total nuinbei of 
available bits to get the number of bits available for coding the samples 

A.ftei cilculatmg the actual number of bits available vliich is 14320 bits the numbei 
of levels in e ich set of qumtizeis is found by iterative procedure bv calculating 

= 2 * mnt{Pi/Thrj) + 1 (4 i) 

where 

= numbei of levels in each set of quantizers 
P = quaiiti/ed peak level 
Thij= qu\nti/ed thieshold value 
nmt = nearest integer function 
j = Bark frequency, varying from 1 to 24 

In this thesis thieshold is initially assumed to be a constant and the numbei ol h \( Is in 
each set of quantizers is then calculated From the number of levels the numbei of bits 
IS calculated and compared with the total numbei of bits available to encode a block of 
data samples If both numbers are not equal the threshold is adjusted accordmgh and / 

IS computed This process is repeated until all available bits have been used 
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The quaiiiized samples along with the side information ha^ to be transmitted to the 
receiver and rcconstiuction of the input signal x[n] is obtained at the receiver using inverse 
transform operation on quantized samples and the results windowed with the same window 
used for analysis Fig 4 4 shows the reconstructed signal of Transform Coder for 16 bit 
PCM input and Fig 4 5 shows the 8 bit PCM Coder output for the same input 




Figure 4 4 (a) 16 bit PCM input (b) Transform Coder output 

Listei mg to the reconstructed audio output reveals that while the 8 bit PCM coded 
segment is noticeably noisy the 8 bits per sample transform coded segment cannot be 
distinguished fiom the original 

Also to compare their performance, Signal-to Noise ratio(SNR) is computed by first 
finding their respective quantization errors SNR computed in decibels for a block of input 
audio signal is as follows 

SNR = 19 74 dB For Transform Coder 

SNR = 14 95 dB For 8 bit PCM coder 

CENTRAL library 

I I T KANPUt 






Figure 4 5 (a) 16 bit PCM input (b) 8 bit PCM Coder output 


The Tliieshold viluc used in Eqn 4 3 can be calculated using peiceptinl noise (iitcin 
Several steps arc involved in calculating the masking threshold of which Cntical Band 
Analysts using Bark spectrum and Spectral Flatness Measure are discussed 

4 3 Critical Band Analysis 

The block diagram of Perceptual Transform Coder is shown m Fig 4 6 
Prom the 1024 imiqiu complex pans computed eailiei (by windowing the input signal and 
processing by FFT), the powei spec ti urn is calculated using 

P{uj) = Re^{u}) + Irn{(jj) (4 4) 

The powei spectrum is shown in Fig 4 9 The spectrum is then partitioned into critical 
bands accoiding[3] to Table 3 1, and the energy in each critical band is summed 
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Figuic 4 6 Block Diagram Of Perceptual Transform Coder 
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( 15 ) 


bh 

A == E Pi^) 

tj—bl 

whcie 

bl = lo-wei boundaiy of critic il bind i 
bhi = uppei bound xiy of ciitical band i 
Bl — cneigv in ciitical band i 



Frequency in Hj 

Figure 4 7 Power Spectrum 

Fig 1 10 shows power spectrum md critical band spectrum (Bark Spectrum) for a 
block of input signal As discussed m section 3 1, critical band rate scale follows i linen 
scale up to about 500 Hz and then a logarithmic frequency scale above 500 Hz 
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Figure 4 8 Power and Bark Spectrum 

4 4 Spectral Flatness Measure 

In oidei to detcimnu Uu noise like oi tone like nature of the signal, the Spectial Flafne'sh 
Measure(SFM) is used The SFIVI is defined as the ratio of the geometric mean(Cm) of 
the powci spectium to the aiithmetic mean(Am) of the powei spectium 



SFM — Pk) ^ 


(IG) 


This latio, by definition is not gieitei than unity SFM in decibels is given by 

sm,B = 10 logic ^ e") 

SFM IS thin used to generate a coefficient of tonality a, to determine whether the signal 
IS entiiely tonclike oi noise like a is given by 
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n = yi)m 


1 


( srM,B 

KSPM, 


(IB mar 


(4 8) 


With SrMdBmax = -QOclB SFM obtained with thiee successive frames of IG bit PCM 
monophonic audio signal of 15 KH/ bandwidth sampled at 32 KHz are as tabulated as 
follows 


Framt No 

SFM clB 

a 

Fond 

/nontonal 

1 

35 577 

0 59 

Tonal 

2 

23 560 

0 39 

Tonal 

3 

32 021 

0 53 

Tonal 


T able 4 1 Tonality Measure 


The complete code (hat is wiittcn using Matlab Package on Pentium PC (e 133 MHz 
foi implement itioii of the H uisfoim Coda and foi computing the Spectral Platnes Mea 
sine and Baik Spcdiiim is given in Appendix 
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Chapter 5 


Future Scope Of The Thesis 

5 1 Scope Of The Thesis 

The riensloini Code is usm^ peucptuil noist (iiterufor efficient coding of steicoplioiiu 
ludio sign ils IS uiidciwiY Mok icceiit woik suggest that the Perceptual Transfoiiii Code i 
could b( used x1 x lowci bit i xto of 3 bits per sample foi tiansparent coding of sigiixl 
souiccs E\ xiniii xi 1011 of soiiu 20 KII/ bxndwidth signals sampled at 44 1 KHz suggest 
thxt tin sxiiK 01 slightly fiwci bits pci sample should piovide tiansparent coding Tin 
leasoii IS thxt the lowei eneigy xt vexy high fiequencies requires fewer than aveiage bits 
foi the addition xl bxndwKlth xe suiting m the lowei rate measured in bits pei sample 
Woik xelxted to iiieie xsed eoclmg etheicncy, cffieient steieo encoding and nioie (ffpctixe 
compiession/p xekmg xlgoxitlims is undeiwxy 

5 2 Introduction To DAB 

5 2 1 Why DAB System*? 

The use of digit xl 1 1 aiisiiiissioii teeliniques for the teiiestiial broadcasting of radio piogi xnis 
was picyiously pit vented by the laigc bandwidth lequnoments that is, high fieciuenex de 
mantis As x icsult of modem audio compiession techniques which exploit the psuhoa 
coiistic piopeities of the human eai the present state of the ait is capable of achiexing 
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consicki iblc icdiulions m tlu bit i\t( ti xnsmittcd Tlioiefoie an oppoitunitj will aiisc foi 
tnnsmittiiit, i diyl dly tncodid sleieo piogiam m in FM channel bandwidth Fhc m uii 
obiectivp ol DAB(Digil il Audio Bio idcislmg) is to ichievc erioi fiee signal 1i iiisimssion 
with b( ltd ludioquility uid x ininimum use ol limdwidth Audio quality must b( in 
impiovducnt ovci the Insu dm \ctciistics(su(h is noise and distoit ion) olFM whose i idio 
clnimels no highly imp me d by inultipith leccptioii Then foie DAB would be in efhcidit 
successoi ol the teiiestiial FM bioadcasts bj occupying the VHF band of 87 5 to 108 
MH/ without leising[9] the e ipaeity to bioadcist the existing numbei of FM piogianis 
R idio eh lime Is using digit il ti insmission techniques will peiinit the opeiatioii ol single 
heciuciKy iKlwniiks m (he luluic riiese digitil single hcquency netwoiks will peiinit 
subst lull d lieeiiu ne y e e oiiouiy Bio ide isting i i leho piogiam thioughout a se n ice iie i 
only leeiunes enie single li uisiuissie)u e)iinnel[2] oi i put ol i conespoiiding multiplex 
signil The lugeilhe seiviee ue i, the less fie ciuency volume is icquiied loi bio idc istiiig i 
piogi uii Foi this le iseui DAB will iceiiuie only ibout one third ol the licciueuc\ lolume 
to biouleast piogi ims, eomiiued with FM toehy In oidei to leuse frequence blocls as 
often IS possilile uiel eliie to eitliei physie il eoiiditions lelatmg to digital single he queue e 
netwoiks tiuismiltd peiwei eeniip lie d wath FM cm be i educed considci ably Onh low 
triiisinittei peiwei pe units the leusc ol the same fiequencv block at distances that aie not 
too gie it Moie eive i ehie etioii il ml emus with levels of up to 20 dB must be used it the 
edges ol seniee ue is te) iiimmu/e leeluecel i idiitioii pitteiiis 

5 2 2 Merits Of The DAB System 

• The D\B single liceiueney ne (wenks wall exploit the icsoiiices ol wneless timsmis 
Sion with exti loidmuy efheieiicy This applies m particulai to progiiins which aie 
blende 1 st nationwiele oi loi luge seiviee areas 
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. DAB will ilso piovKk hskiuisinllu long Uim with i much gieatei piogiam miu 
foi IciKstiiil utcplioii with mobile uicl poit iblo laclios Full covemge broidf istmg 
ol piopi uiis ilnl u( (luu litc uid ol (xcdknt quility \m11 be ensuied it tin slut 
ol DAB dluf IS m luluu, DAB will Ikiouk (he CD RADIO 
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Appendix 


/ Transform Coder 

N=2048 

m=input( Enter the number of frames ’) 
fidl=fopen( ’m tim’ ’r’) 
dum = fread(fidl 20 ’ short 0 
X = fread(fidl 2048*m 'short') 
xl=reshape(x N m) 

/ banning window 

for cl=l 266 
g(cl)=cos (2>t-pi*c 1/257) 
h(cl)=sqrt(0 5*(l“g(cl))) 
end 

t=ones(l 1792) 
u=t(l ) 
w(129 1920)=u 
for c2=l 128 

w(c2)=h(c2) 

w(1920+c2)=h(128+c2) 

end 

c=l 2048 
for 1=1 m 
W(c)=w 

F(1 2048)=W(c), 

K =xl(l N 1) *F' , 

ft=fft(K N) 

reft=ft 

recp=real(ft) 

imcp=iraag(ft) 

ft(l)=ft(l)+ft(1025)*i 

sp=ft(l 1024) 

ft (1025 2048) = [], 
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re=real(sp) 

im=iinag(sp) 

/ Spectral peak calculation & Bit rate calculation 

sr=abs (re) 
si=abs(im) 
sql=0 0 
thj=7 02 
count=l 8 
for ind=l 128 

srm=max(sr (count) ) 
siin=max (si (count) ) 
p(ind)=max(srm sim) 
pdb ( ind) =20*logl0 (p ( ind) ) 
pquan(ind)=0 664*round( (pdb(ind) /O 664)) 
pval(ind)=iO'' (pquan(ind) /20) 
ql ( ind) =2*round (pval ( 3 nd) /thj ) +1 
nbt (ind) =round(iog2 (ql (ind) ) ) 
count=count+8 
end 

jj=l 16 

for 11=1 128 
for jl=jj 

rec(jl)=nbt(ii) 

end 

end 

sql=sura(ql) , 
totb=suin(rec) , 
while totb > 14320 
thj=thj=*=l 006 
for idl=l 128, 
ql(idl)=0 0 
nbt(idl)=0 0 
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end 

for idl=l 128 

ql(idl)=2-^round(pval(idl)/thj)+l 
nbt (idl)=round(log2(ql(idl))) 

end 

j2=l 16 
for il=l 128 
for j3=j2 

rec(j3)=nbt(il) 

end 

j2=j2+16 

end 

sql=sum(ql) 

totb=sum(iec) 

end 

while totb < 14320 
thj=thj/l 002 
for id2=l 128, 
ql(id2)=0 0 
nbt(id2)=0 0 

end 

for id2=l 128, 

ql ( id2) =2-+'round (pval (id2) /thj )+l 
nbt (id2) =round(log2 (ql(id2) ) ) 
end 

j4“l 16 
for i2=l 128 
for 35*34 

rec(35)=nbt(i2) 

end 

34=34+16, 

end 

sql=sura(ql) 
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totb=sum(rec) 


end 

lev=2 rec 
13=1 16 
for i4=l 128 
for i6=33 

nlev (i4) =max (rec (i3) ) 

end 

i3=i3+16 

end 

for i6=l ] 28 

Bfep(i6) = (2*pv ii(r6))/2 (nlev(i6)) 
end 

j6=l 16 
for ilO=l 128 
for j/'=j6 

ssi^(j7)=step(il0) 

end 

j6=j6+16 

end 

retrs=recp' , 
imtrs=iincp' 
for x7=1 2048 

requanCi7) =381/ ( 3 1 ) 'Ground (retrs (i7) /ssiz (i7) ) 
imquan(i/')='8Siz(a7)*round(imtrs(i7)/ssiz(i7)) 
end 

for ill=l 2048 

redat(ill)»Crequan(ill)) + (imquan(ill)*i) , 
end 

redatt«conj (redat ’ ) 
s“ifft(redatt,N) 
e=realCs) , 
rw=e 
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fid 2 =fopen(’ Ic di.L' *'i’) 

count! =iprinl J: (( i.d2 ”/b 4f\ii rw) 
rwL=rw' 

/ J6 bit PCM to ’b’ bit PCM conversion 
b=input('Lnlc'i Lho numboi ol bits ’) 
step=(2*32767)/(2''b) 
for il2=l 2048 

bit8(iJ2)=stGp>t-round(xl(il2 l)/step) 
end 

bitt=bifc8' 

fid3=J oponC ' c odor d il ’ ' i’ ) 
coimL2=fpt iiU ! (I ul ! '“/b 4(\n' batt) 

'/pauso 
c=c^ 1920 

“/sound (xl ^2000) 

/pause (b) 

/sound (rw 32000) 
fcloseC'ill ’) 

[ 

figured ) 

subplot (1 2,1) plot(xl(l 2048 1)) 

xlabelC ’ Sample Numbot’) ylabeK’ Amplitude’) 

subplot (1 2 2) plotCrwCl 2048)), 

xlabaK ’SampLo Numbot ’ ) .yLaboK’ Amplitude') , 

figure (2) , 

subplod(l,2 1) plot(x1(l 2048,1)), 
xlabeK 'Sample Niimboi’) ylabeK 'Amplitude') 
subplotCl 2 2) plot(bitt(l 2048)) 
xlabeK' Sample Number ’) ylabeK 'Amplitude') 

'/.print txcod -ffigureCl) "deps2 
'/printf cod8 “ff3gure(2) '•deps2 
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/ PticopLinl Iransform coder 


N=2048 

fs=32000 

in==input( ’tnLer tho numbc r oi frxmos >) 

fidl=fopen( 'm tim' 'i') 

dum = freadClidJ 20 'bhort') 

X = freadClidl 2048'^•m ’shoji') 
xl=reshipe(x N m) 

’/ hmninir window 

foi cl“J 2‘)() 
g(cl)=cos(2*pi*( 1/ 
h(cl)=sqrt(0 ')K1 p(c 1))) 
end 

t=ones(J 1^92) 
u=t(l ) 
w(129 1920)=u 
for c2=i 128, 

w(c2)-h(c ’) 
w(1920Hc2)=h(128+c2) 
end 

c-1 2048, 

'/( ffO & pad c.ihiiJ It ton 
for 1=J m 
W(c)=w, 

F(1 2048)“W(c) 

K =xJ(l N.l) n ’ 
ft=fft(K,N) , 

sp«ft(i 1024) 
ft(102B 2048)an , 
freq=16000*(0 1023)/J024, 
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fqt^licq’ 
re=rodl ( p) 
im=imaf (^p) 
ypw=(3G) ) t (im) 
pw=(sp •<<coni( p)')/hiiptlU p) 

pwdb=iO'<‘loglO(pw) 
fid2=fopcn( 'pw (lit ’w’) 
fprintfCridi ’"/b \l\t/() -11 \n' (qt pw) 

figure (1) 
plot(fi(:-q pwdb) 

xlabcK'I r( qiu'iu Y inf! ) ylibiK’Powoi in db') 

*/ t 1 1 1 u I It uMi Hi { H mu ify 

fidS^foponC ’ I tin tl il * t ' ) 
lt=fscant (1 id I 'Xdd ) 
c3=l ? 
cnt=lt(c3) 

B=zeros(l ?A) 
slp=0 0 

fid4=fop0n( ’ CqJ im d U ’ 'i ) 
llt=fscdni Cf ul4 ’*/')d’) 
cc3=J ?, 
ccnt»llt (cc 

/or c4 I M, 

lor ib cut (13 cnt(2) , 
tip tLptpwCcb) 
and 

for rcb tcnt(i) cciitC?), 
bkCccb) -sip 
snd 

cc3«cc3+l 
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OC Til ill (cc i) 

R(t<l) ip 
Ip 0 0 
c H-i 311 
cnt U (c 3) 
c 111 ( I ) c n t ( n I I 
c4”c <1 1 1 

end 

bdb"10*'lop lO(bk) 

flgUTcC 

plolCltiuj [)vnlh 'y ' 1 IbbOO bdb ’r-') 
floxK'PowoT Sp(HHum') ft oxt ( >Buk Spectrum') 
xJibol ( ’I 1 (UiuotK y in II ’) y libel ('Powei in db') 


'/. c ilculalion of SFM 

d3v=i/lcnf t h(pw) 

gm=0 0 

for cc8»=i iOM 

gm«gm+iO^ loflOCpwCrca) ) 
end 

Gm'=diV'*'gm, y Geometric mean 

AmalO*logJO(incuuCpw) ) */, Arithmetric mean 

SFM*Gm“Am 

aLfa®min ( (SI M/C 00)) 1) ’/ tonality index 

fclosoC’ali’ ) 
c«c+1920 

1 

figure (3) 

plotCl 2048, w >y ‘ 1921 3968, w 'y-^3841 5888, w 'y-O 
xlabel ( ' Sample number ’ ) , ylabe) ( ’ Amplitude ’ ) , 

'/print psp "fiigureCD -d 0 pa 2 , 
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/print bsp -ffigure(2) -deps2 
/print win -ffigureO) -deps2 


/ SNR Calculation 

fidl=fopen( ’ra tim’ ‘r‘) 
dum = lread(fidi 20 short’) 

X = fread(fidl 8000 ’short’) 
fid2=fopen( ’tc dat’ ’r’) 
tout=fscanf (f id2 ’/f’) 
fid3=fopen( ’ coder dat’ ’r ) 
codout=f scanf (f xd3 ’’/f’) 
ltc=length(tout) 
inps=x(200 Itc) "2 
avin=(sum(inps)) /(ltc-200) 
qetc=x(200 Itc) -tout (200 Itc) 
sqtc=qetc "2 

avtc=(sum(sqtc) )/ (ltc-199) 
snrval=avin/avtc 
snrtc=10*logl0 (snrval) 
qecod=x(200 ltc)-codout(200 Itc) 
sqcod=qecod "2 

avcod= (sum (sqcod) )/ (ltc-199) 
snrcod=avin/avcod 
snrSslO’HlogiO (snrcod) 
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